Distant Speaker Verification Using a Combined Family of MVDR Estimates
نویسندگان
چکیده
Distant speaker verification involves explicit spectral estimation of speech acquired over microphone arrays. The choice of the appropriate set of microphones is important here. In this paper we describe an implicit approach to minimum variance distortionless response (MVDR) spectral estimation of distant talking speech and its application in distant speaker verification. A mathematical formulation for computing an implicit spectral estimate for speech acquired over a uniform linear array (ULA) is first presented. This formulation is based on a simple mathematical relation between a fixed order MVDR spectral estimate, the harmonics in speech, and the noise power. This relationship is used for spectral modeling of distant talking speech by jointly combining a family of MVDR estimates and the number of elements in the ULA. The performance of the proposed implicit MVDR spectral estimation method is evaluated in terms of cepstral distance measure indicating improvements over the Fourier spectral estimates obtained from the individual elements of the ULA. Experiments on distant speaker verification using speech data from the NIST 2004 corpus indicate reasonable improvements when compared to conventional MFCC from the individual elements from the ULA.
منابع مشابه
Using Exciting and Spectral Envelope Information and Matrix Quantization for Improvement of the Speaker Verification Systems
Speaker verification from talking a few words of sentences has many applications. Many methods as DTW, HMM, VQ and MQ can be used for speaker verification. We applied MQ for its precise, reliable and robust performance with computational simplicity. We also used pitch frequency and log gain contour for further improvement of the system performance.
متن کاملA New Post-filter Algorithm Combined with Two-step Adaptive Beamformer
The optimal microphone array, in the sense of minimum mean square errors (MMSE), includes two processing blocks: the minimum variance distortionless response (MVDR) beamformer and the single-channel Wiener filter, which acts as post-filter. In this paper, we propose a new post-filter algorithm based on assumptions that both the noise power attenuation factor (NPAF) and signal power attenuation ...
متن کاملUsing Exciting and Spectral Envelope Information and Matrix Quantization for Improvement of the Speaker Verification Systems
Speaker verification from talking a few words of sentences has many applications. Many methods as DTW, HMM, VQ and MQ can be used for speaker verification. We applied MQ for its precise, reliable and robust performance with computational simplicity. We also used pitch frequency and log gain contour for further improvement of the system performance.
متن کاملThe 2011 KIT English ASR system for the IWSLT evaluation
This paper describes our English Speech-to-Text (STT) system for the 2011 IWSLT ASR track. The system consists of 2 subsystems with different front-ends—one MVDR based, one MFCC based—which are combined using confusion network combination to provide a base for a second pass speaker adapted MVDR system. We demonstrate that this set-up produces competitive results on the IWSLT 2010 dev and test s...
متن کاملExploring Features for Text-dependent Speaker Verification in Distant Speech Signals
Automatic speaker verification (ASV) is the task of verifying a person’s claimed identity from his/her voice using a digital computer. The existing ASV systems perform with high accuracy of verification when the speech signal is collected close to the mouth of the speaker (< 1 ft). However, the performance of the ASV systems reduces significantly for speech signals collected at a distance from ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012